Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Multimodal Video QA
# Multimodal Video QA
Videollama2 7B 16F Base
Apache-2.0
VideoLLaMA 2 is a multimodal large language model focused on enhancing spatio-temporal modeling and audio understanding in video comprehension.
Text-to-Video
Transformers
English
V
DAMO-NLP-SG
64
2
Featured Recommended AI Models
Empowering the Future, Your AI Solution Knowledge Base
English
简体中文
繁體中文
にほんご
© 2025
AIbase